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MSPSPTALFCI 
GGAGTCGACCCACGCGTCCGCAGGGCTGAGGAACC ATG TCT CCA TCC CCG ACC GCC CTC TTC TGT CTT 

GLCLGRVPAQSGPLPKPSLO n 
GGG CTG TGT CTG GGG CGT GTG CCA GCG CAG AGT GGA CCG CTC CCC AAG CCC TCC CTC CAG 128 

ALPSSL VPLEKPVTLRCQGP ^ 
GCT CTG CCC AGC TCC CTG GTG CCC CTG GAG AAG CCA GTG ACC CTC CGG TGC CAG GGA CCT 188 

PGVDLY RLEKLSSSRYQDQA 71 
CCG GGC GTG GAC CTG TAC CGC CTG GAG AAG CTG AGT TCC AGC AGG TAC CAG GAT CAG GCA 248 

VLFIPAMKRSLAGRYRCSYQ 91 
GTC CTC TTC ATC CCG GCC ATG AAG AGA AGT CTG GCT GGA CGC TAC CGC TGC TCC TAC CAG 308 

NGSLWSLPSDQLELVATGVF111 
AAC GGA AGC CTC TGG TCC CTG CCC AGC GAC CAG CTG GAG CTC GTT GCC ACG GGA GTT TTT 368 

AKPS LSAQPGPAVSSGGDVT131 
GCC AAA CCC TCG CTC TCA GCC CAG CCC GGC CCG GCG GTG TCG TCA GGA GGG GAC GTA ACC 428 

h. QCCjTRYGFDQFALYKEGDP 151 
CTA CAG TGT CAG ACT CGG TAT GGC TTT GAC CAA TTT GCT CTG TAC AAG GAA GGG GAC CCT 488 

APY KNPERWYRASFPIITVT171 
GCG CCC TAC AAG AAT CCC GAG AGA TGG TAC CGG GCT AGT TTC CCC ATC ATC ACG GTG ACC 548 

AAH SGTYRCYSFSSRDPYLW191 
GCC GCC CAC AGC GGA ACC TAC CGA TGC TAC AGC TTC TCC AGC AGG GAC CCA TAC CTG TGG 608 

SAPSDPLELVVTGTSVTPSR211 
TCG GCC CCC AGC GAC CCC CTG GAG CTT GTG GTC ACA GGA ACC TCT GTG ACC CCC AGC CGG 668 

LPTEPPSSVAEFSEATAELT 231 
TTA CCA ACA GAA CCA CCT TCC TCG GTA GCA GAA TTC TCA GAA GCC ACC GCT GAA CTG ACC 728 

VSFTN KVFTTETSRSITTSP251 
GTC TCA TTC ACA AAC AAA GTC TTC ACA ACT GAG ACT TCT AGG AGT ATC ACC ACC AGT CCA 788 

KESDSp AGPARQYYTKGNLV271 
AAG GAG TCA GAC TCT CCA GCT GGT CCT GCC CGC CAG TAC TAC ACC AAG GGC AAC CTG GTC 848 

RICL GAVILIILAGFLAEDW291 
CGG ATA TGC CTC GGG GCT GTG ATC CTA ATA ATC CTG GCG GGG TTT CTG GCA GAG GAC TGG 908 

HSRRKRLRHRGRAVQRPLPP 311 
CAC AGC CGG AGG AAG CGC CTG CGG CAC AGG GGC AGG GCT GTG CAG AGG CCG CTT CCG CCC 968 
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LPPLPQTRKSHGG QDGGRQD331 
CTG CCG CCC CTC CCG CAG ACC CGG AAA TCA CAC GGG GGT CAG GAT GGA GGC CGA CAG GAT 1028 

VHSRGLCS* 340 
GTT CAC AGC CGC GGG TTA TGT TCA TGA 10 55 

CCGCTGMCCCCAGGCACGGTCCTATCCMGG^ n34 

CGTGGAAGCAGGAGGGCAGAGGCTACAGCTGTGGAMCGAGGCCATGCTGCCTCCTCCTGGTGTTCCATCAGGG^ 1213 
TTCGGCCAGTGTCTGTCTGTCTGTCTGCCTCTCTGTCTGAGGGCACCCTCCATTTGGGATGGAAGGAAT^ 1292 
CCCATCCTCCTCCCTGCACACTGTGGATGACATGGTACCCTGGCTGGACCACATACTGGCCTCTTTCTTCMCCTCT^ 1371 
MTATGGGCTCCAGACGGATCTCTMGGTTCCCAGCTCTCAGGGTTGACTCTGTTCCATCCTCTGTGCAAMTCCTCC^ 1450 
GTGCTTCCCTTTGGCCCTCTGTGCTC^ 152 9 

TMCAMTCTCCTTTCGTCTCTCAGMCGGGTCTTGCAGGCAGTTTGGGTATGTCATTCATT^ 1608 

AGCACGTTGCCCGCTTCCCTTCACATTAGAAMCMGATCAGCCTGTGCMCATGGTGAMCCTCATCTCT^^ 1687 

MCAAAAAMCACAAAMTTAGCCAGGTGTGGTGGTGCATCCCTATACTCCCAGCMCTCGGGGGG^ 1766 

ATGGCTTGAGCCTGGGAGGICAGAGGTTGCACT 1845 

CCTTGTCTCAAAAMTACAGGGATGMTATGTCM 1924 

ATTGCTGTCC^CCCCATAMTATGTACMnATGTATACATTmAAMTCATAAAM 2003 

AAAAAAAAAAAAAAGGGCGGGCCGCTAGACTAGTCTAGAGAACA 2047 

FIG.1B 
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1 41 81 121 161 201 241 281 321 



MSPSPTALFCLGLCLGRVPAQSGPLPKPSLQALPSSLVPLEKPVTLRCQGPPGVDLYRLE 
KLSSSRYQDQAVLF I PAMKRSLAGRYRCSYQNGSLWSLPSDQLELVATGVFAKPSLSAQP 
GPAVSSGGDVTLXQTRYGFDQFAL YKEGDPAPYKNPERWYRASFP 1 1 TVTAAHSGTYRC 
YSFSSRDPYLWSAPSDPLELWTGTSVTPSRLPTEPPSSVAEFSEATAELTVSFTNKVFT 
TETSRS I TTSPKESDSPAGPARQYYTKGNLVRICLGAVI L 1 1 LAGFLAEDWHSRRKRLRH 
RGRAVQRPLPPLPPLPQTRKSHGGQDGGRQDVHSRGLCS 



FIG.2 



Docket No.: 7853-211-999 

Serial No.: 09/610,118 
Inventor(s): BUSFIELD et al. 
Title: "GLYCOPROTEIN 
VI AND USES THEREOF" 



10 20 30 40 50 60 70 

inputs ATGACGCCCGCCCTCACAGCCCTGCTCTGCCTTGGGCTGAGTCTGGGCCCCAGGACCCGCGTGCAGGCAG 

ATGTCTCCATCCCCGACCGCCCTCTTCTGTCTTGGGCTGTGTCTGGGGCG - TGTGCC AGC - - GC AGAGTG 
10 20 30 40 50 60 

80 90 100 110 120 130 

1 nput s GGCCCTTCCCCAAACCCACCCTCTGGGCTGAGCC AGGCTCTGTGAT - CAGCTGGGGGAGCCCCGTGACC A 

*. I * t I . 1*11* ... .... .. . . . ...... 

GACCGCTCCCCMGCCCTCCCTCCAGGCTCTGCCCAGCTCCCTGGTGCCCCTGGAGMGCCA-GTGACCC 
70 80 90 100 110 120 130 

140 150 160 170 180 190 200 

i nputs TCTGGTGTCAGGGGAGCCTGGAGGCCCAGGAGTACCGACTGGATAAAGAGGGAAGCCCAGAGCCCTTGGA 

... ...... ... . ... ... 

■•••••••••» •••• **...... ... ... ....... ... 

TCCGGTGCCAGGG- -ACCT CCGGGCGTG- - GACCTGTA CCGCCTGGAG AAG 

140 150 160 170 180 

210 220 230 240 250 260 270 

inputs CAGAAATAACCCACTGGAACCCAAGAACAAGGCCAGATTCTCCATCCCATCCATGACAGAGCACCATGCG 



CTGAGTT - - CCAGCAGGTACC - AGGATC A - GGCAGTCCTCTTCATCCCGGCCATGAAGAGAAGTCTGGCT 
190 200 210 220 230 240 

280 290 300 310 320 330 340 

i nputs GGGAGATACCGCTGCCACTATTACAGCTCTGCAG - - GCTGGTCAGAGCCCAGCGACCCCCTGGAGCTGGT 

GGACGCTACCGCTGCTCCTAC - - CAGAACGGAAGCCTCTGGTCCCTGCCCAGCGACCAGCTGGAGCTCGT 
250 260 270 280 290 300 310 

350 360 370 380 390 400 410 

inputs GATGACAGGATTCTACAACAAACCCACCCTCTCAGCCCTGCCCAGCCCTGTGGTGGCCTCAGGGGGGAAT 

i ; m I::;::::;:.::::.:::: ■ r. 

TGCCACGGGAGTTTTTGCCAAACCCTCGCTCTCAGCCCAGCCCGGCCCGGCGGTGTCGTCAGGAGGGGAC 
320 330 340 350 360 370 380 

420 430 440 450 460 470 480 

i nputs ATGACCCTCCGATGTGGCTCACAGAAGGGATATCACCATTTTGnCTGATGAAGGAAGGAGAACACCAGC 



GTAACCCTACAGTGTCAGACTCGGTATGGCTTTGACCAATTTGCTCTGTACAAGGAAGG 
390 400 410 420 430 440 



490 500 510 520 530 540 550 

inputs TCCCCCGGACCCTGGACTCACAGCAGCTCCACAGTGGGGGGTTCCAGGCCCTGTTCCCTGTGGGCCCCGT 



■GGACCCTG C- - -GCCCTA CAA 

450 460 
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560 570 580 590 600 610 620 

i nputs GAACCCCAGCCACAGGTGGAGGTTCACATGCTATTACTATTATATGAACACCCCCCAGGTGTGGTCCCAC 
•••••• ••••• ■•»•« 

GAATCCCGA GAGATGGTAC - CGGGCTAGT - - - -TT CCCCAT CAT 

470 480 490 500 

630 640 650 660 670 680 690 

inputs CCCAGTGACCCCCTGGAGATTCTGCCCTCAGGCGTGTCTAGGAAGCCCTCCCTCCTGACCCTGCAGGGCC 



CACGGTGACCGCC GCCCACAG ■ 

510 520 



700 710 720 730 740 750 760 

inputs CTGTCCTGGCCCCTGGGCAGAGCCTGACCCTCCAGTGTGGCTCTGATGTCGGCTACGACAGATTTGTTCT 



CGGAACCTA CCGATG CTACAGC TTCT 

530 540 550 

770 780 790 800 810 820 830 

inputs GTATAAGGAGGGGGAACGTGACTTCCTCCAGCGCCCTGGCCAGCAGCCCCAGGCTGGGCTCTCCCAGGCC 



■CCAGCAG- 



840 850 860 870 880 890 900 

inputs AACTTCACCCTGGGCCCTGTGAGCCCCTCCCACGGGGGCCAGTACAGGTGCTATGGTGCACACAACCTCT 



GGACCCA TACCT-- 

560 

910 920 930 940 950 960 970 

inputs CCTCCGAGTGGTCGGCCCCCAGCGACCCCCTGAACATCCTGATGGCAGGACAGATCTATGACACCGTCTC 



GTGGTCGGCCCCCAGCGACCCCCTGGA GCT TGTG 

570 580 590 600 

980 990 1000 1010 1020 1030 1040 

inputs CCTGTCAGCACAGCCGGGCCCCACAGTGGCCTCAGGAGAGAACGTGACCCTGCTGTGTCAGTCATGGTGG 



- - - GTCA CAGGMCCTCTGTGACC CCCAGC CGGT 

610 620 630 

1050 1060 1070 1080 1090 1100 1110 

1 nputs CAGTTTGACACTTTCCTTCTGACCAAAGAAGGGGCAGCCCATCCCCCACTGCGTCTGAGATCAATGTACG 



TACCAACAGAAC CA - - CCTTCC TCG 

640 650 

1120 1130 1140 1150 1160 1170 1180 

inputs GAGCTCATAAGTACCAGGCTGAATTCCCCATGAGTCCTGTGACCTCAGCCCACGCGGGGACCTACAGGTG 



GTA GCAGAATTCTC AGAAGCCAC CGCTGA ACTG- - A 

660 670 680 690 

FIG.3B 
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1190 1200 1210 1220 1230 1240 1250 

inputs CTACGGCTCATACAGCTCCAACCCCCACCTGCTGTCTTTCCCCAGTGAGCCCCTGGAACTCATGGTCTCA 

C- -CGTCTCATTCA- - -CAAAC AAAGtCTT- -CACAA CtGAGACT TCf - - 

700 710 720 730 

1260 1270 1280 1290 1300 1310 1320 

inputs GGACACTCTGGAGGCTCCAGCCTCCCACCCACAGGGCCGCCCTCCACACCTGGTCTGGGAAGATACCTGG 



AGGAGTATC - - ACCACCAGTCCAAAGGA - - GTCAGACTCTCCAG - -CTGG 

740 750 760 770 

1330 1340 1350 1360 1370 1380 1390 

inputs AGGTTTTGATTGGGGTCTCGGTGGCCTTCGTCCTGCTGCTCTTCCTCCTCCTCTTCCTCCTCCTCCGACG 



TCCTGC CCGCCAGTA CTACACCAAGG 

780 790 800 

1400 1410 1420 1430 1440 1450 1460 

inputs TCAGCGTCACAGCAAACACAGGACATCTGACCAGAGAAAGACTGATTTCCAGCGTCCTGCAGGGGCTGCG 

GCAAC CTGGtC CGGAtAT - - - GCCTC GGGGCTG - - 

810 820 830 

1470 1480 1490 1500 1510 1520 1530 

i nputs GAGACAGAGCCCAAGGACAGGGGCCTGCTGAGGAGGTCCAGCCCAGCTGCTGACGTCCAGGAAGAAAACC 



TGATCCTAATAA TCCTG - - GCGGGGTTTCTG GCAGA- GGACTGG C 

840 850 860 870 

1540 1550 1560 1570 1580 1590 1600 

i nputs TCTATGCTGCCGTGAAGGAC ACACAGTCTGAGG - AC AGGGTGGAGCTGGACAGT - CAGAGCCCAC ACGAT 



AC AGCCG- -GAGGAAGCGC- - - CTGCGGCACAGGG GCAGGGCTGTGCAGAGGCCGCT 

880 890 900 910 920 

1610 1620 1630 1640 1650 1660 1670 

inputs GAAGACCCCCAGGCAGTGACGTATGCCCCGGTGAAACACTCCAGTCCTAGGAGAGAAATGGCCTCTCCTC 

TCC GCCCCTG CCGC C 

930 940 

1680 1690 1700 1710 1720 1730 1740 

i nputs CCTCCTCACTGTCTGGGGAATTCCTGGACACAAAGGACAGACAGGTGGAAGAGGACAGGCAGATGGACAC 



CCTCC-CGCAGAC CCGGAAATCA CA- -CGGG GGTCAGG- - -ATGGA- - - 

950 960 970 980 

1750 1760 1770 1780 1790 1800 1810 

i nput s TGAGGCTGCTGCATCTGAAGCCTCCCAGGATGTGACCTACGCCC AGCTGCACAGCTTGACCCTTAGACGG 



- - - GGC CGAC AGGATGTT CACAGC CG - 

990 1000 

1820 1830 1840 1850 1860 1870 1880 

inputs AAGGCAACTGAGCCTCCTCCATCCCAGGAAGGGGAACCTCCAGCTGAGCCCAGCATCTACGCCACTCTGG 



■CGGGTTATG TTCA- 

1010 



1890 
inputs CCATCCAC 



FIG.3C 
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10 20 30 40 50 60 
i nputs MSPSPTALFCLGLCLG- RVPAQSGPLPKPSLQALPSSLVPLEKPVTLRCQGPPGVDLYRLEKLSSS 

MtPALTALLCLGLSLGPRTRVQAGP^ 

10 20 30 40 50 60 70 

70 80 90 100 110 120 130 

i nputs RYQ DQAVLFIPAMKRSLAGRYRCSYQNGSLWSLPSDQLELVATGVFAKPSLSAQPGPAVSSGGDV 

RNNPLEPKN^RFSIPSMTEHHAGRYRCH 

80 90 100 110 120 130 140 



inputs TLQCQT RY- 



TLRCGSQKGYHHFVLMKEGEHQLPRTLDSQQLHSGGFQALFPVGPVNPSHRWRFTCYYYYMNTPQVWSHP 
150 160 170 180 190 200 210 

140 150 

inputs GFDQFALYKEGDP 



SDPLEILPSGVSRKPSLLTLQGPVLAPGQSLTLQCGSDVGYDRFVLYKEGERDFLQRPGQQPQAGLSQAN 
220 230 240 250 260 270 280 

160 

inputs APYK NP ERW-- 

FTLGPVSPSHG(^YRCYGAHNLSSEWSAPSDPLNILMAGQIYDTVSLSAQPGPTVASGENVTLLCQSWWQ 
290 300 310 320 330 340 350 

170 180 190 200 

i nputs YRASFPI ITVTAAHSGTYRCYSFSSRDPYLWSAPSDPLELVVTG 



FDTFLLTKEGAAHPPLRLRSMYGAHKYQAEFPMSPVTSAHAGTYRCYGSYSSNPHLLSFPSEPLELMVSG 
360 370 380 390 400 410 420 

210 220 230 240 250 260 

inputs TSVTPSRLPTEPPSS- - VAEFSEATAELTVSFTNKVF TTETSRSITTSPKESD- -SPAGPA- 



HSGGSSLPPTGPPSTPGLGRYLEVLIGVSVAFVLLLFLLLFLLLRRQRHSKHRTSDQRKTDFQRPAGAAE 
430 440 450 460 470 480 490 

270 280 290 
inputs RQYYTKGNLVRICLGAVIL IILAGFLAEDW '- - -HSRRKR 



TEPKDRGLLRRSSPAADVQEENLYAAVKDTQSEDRVELDSQSPHDEDPQAVTYAPVKHSSPRREMASPPS 
500 510 520 530 540 550 560 

300 310 320 330 
inputs LRHRGRAVQ- -RPL PPLPPLPQTRK SHGGQDGGRQDVHSRGLC 

SLSGEFLDTKDRQVEEDRQMDTEAMSEASQDVTYA 

570 580 590 600 610 620 630 

inputs S 

FIG. 4 
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*- >GesvtLtCsvsgf gppgvsvtWyf kngk . 1 gpsll gysysrl esgek 
+ vtl_+C+ + v y + k ++ r++ + 

hT268 41 EKPVTLRCQGP PGVDLY-RLEK1 SSS RYQDQ- - 70 

anl segrf si ssl tLti ssvekeDsGtYtCvv<-* 
++L i +++ +G Y+C 
hT268 71 AVLFIPAMKRSLAGRYRCSY 90 

FIG.5A 



*- >GesvtLtCsvsgf gppgvsvtWyf kngk . 1 gpsl 1 gysysrl esgek 
G++vtL+C+++ + ++ y k+g++ + y+++ 
hT268 127 GGDVTLQCQTR- - -YGFDQFALY-KEGDpAP YKNPERWYR- - 162 

anl segrf si ssl tLti ssvekeDsGtYtCvv<-* 
++++i++v++ sGtY+C 
hT268 163 ASFP I ITVTAAHSGTYRCYS 182 

FIG.5B 
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M S P A 4 

GAGTCGACCCACGCGTCCGCTTCCCTGCTTGGCCACATAGCTCAGGACTGGGTTGCA6AACC ATG TCT CCA GCC 74 

SPTFFCIGLCVLQVIQTQSG 24 

TCA CCC ACT TTC TTC TGT ATT GGG CTG TGT GTA CTG CAA GTG ATC CAA ACA CAG AGT GGC 134 

PLPKPSLQAQPSSLVPLGQS 44 

CCA CTC CCC AAG CCT TCC CTC CAG GCT CAG CCC AGT TCC CTG GTA CCC CTG GGT CAG TCA 194 

VILRCQGPPDVDLYRLEKLK 64 

GTT ATT CTG AGG TGC CAG GGA CCT CCA GAT GTG GAT TTA TAT CGC CTG GAG AAA CTG AAA 254 

PEKYEDQDFLFIPTMERSNA 84 

CCG GAG AAG TAT GAA GAT CAA GAC TTT CTC TTC ATT CCA ACC ATG GAA AGA AGT AAT GCT 314 

GRYRCSYQNGSHWSLPSDQL 104 

GGA CGG TAT CGA TGC TCT TAT CAG AAT GGG AGT CAC TGG TCT CTC CCA AGT GAC CAG CTT 374 

ELIATGVYAKPSLSAHPSSA 124 

GAG CTA ATT GCT ACA GGT GTG TAT GCT AAA CCC TCA CTC TCA GCT CAT CCC AGC TCA GCA 434 

VPQGRDVTLKCQSPYSFDEF 144 

GTC CCT CAA GGC AGG GAT GTG ACT CTG AAG TGC CAG AGC CCA TAC AGT TTT GAT GAA TTC 494 

VLYKEGDTGPYKRPEKWYRA 164 

GTT CTA TAC AAA GAA GGG GAT ACT GGG CCT TAT AAG AGA CCT GAG AAA TGG TAC CGG GCC 554 

NFP I ITVTAAHSGTYRCYSF 184 

AAT TTC CCC ATC ATC ACA GTG ACT GCT GCT CAC AGT GGG ACG TAC CGG TGT TAC AGC TTC 614 

SSSSPYLWSAPSDPLVLVVT 204 

TCC AGC TCA TCT CCA TAC CTG TGG TCA GCC CCG AGT GAC CCT CTA GTG CTT GTG GTT ACT 674 

GLSATPSQVPTEESFPVTES 224 

GGA CTC TCT GCC ACT CCC AGC CAG GTA CCC ACG GAA GAA TCA TTT CCT GTG ACA GAA TCC 734 

SRRPSILPTNKISTTEKPMN 244 

TCC AGG AGA CCT TCC ATC TTA CCC ACA AAC AAA ATA TCT ACA ACT GAA AAG CCT ATG AAT 794 

ITASPEGLSPPIGFAHQHYA 264 

ATC ACT GCC TCT CCA GAG GGG CTG AGC CCT CCA ATT GGT TTT GCT CAT CAG CAC TAT GCC 854 

KGNLVRICLGATI I I I LLGL 284 

AAG GGG AAT CTG GTC CGG ATA TGC CTT GGT GCC ACG ATT ATA ATA ATT TTG TTG GGG CTT 914 

LAEDWHSRKKCLQHRMRALQ 304 

CTA GCA GAG GAT TGG CAC AGT CGG AAG AAA TGC CTG CAA CAC AGG ATG AGA GCT TTG CAA 974 

RPLPPLPLA* 314 

AGG CCA CTA CCA CCC CTC CCA CTG GCC TAG 1004 

AMTMCTTGGCTTTCAGCAGAGGGATTGACCAGACATCCATGCACMCCATGGACATCACCACTAGAGCCACAGACA^ 1083 

GGACATACTCMGACTGGGXaAGGTTATATAA 1162 

A 1163 

FIG. 6 
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Out 



I . | . | . | i | . | . | . | . | i | i , . | i | . | . | . | i 
1 41 81 121 161 201 241 281 



MSPASPTFFC IGLCVLQV I QTQSGPLPKPSLQAQPSSLVPLGQSV I LRCQGPPDVDLYRL 
EKLKPEKYEDQDFLF I PTMERSNAGRYRCSYQNGSHWSLPSDQLEL I ATGVYAKPSLSAH 
PSSAVPQGRDVTLKCQSPYSFDEFVLYKEGDTGPYKRPEKWYRANFP 1 1 TVTAAHSGTYR 
CYSFSSSSPYLWSAPSDPLVLWTGLSATPSQVPTEESFPVTESSRRPS I LPTNK I STTE 
KPMN I TASPEGLSPP IGFAHQHYAKGNLVR I CLGAT 1 1 1 1 LLGLLAEDWHSRKKCLQHRM 
RALQRPLPPLPLA 



FIG.7 
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10 20 30 40 50 60 70 

inputs ATGACGCCCGCCCTCACAGCCCTGCTCTGCCTTGGGCTGAGTCTGGGCCCCAGGACCCGCGTGCAGGCAG 



ATGTCTCCAGCC - TCAC - -CC- - - -ACTTTCTT- - -CTGTAT- 
10 20 30 



80 90 100 110 120 130 140 

inputs GGCCCTTCCCCAMCCCACCCTCTGGGCTGAGCCAGGCTCTGTGATCAGCTGGGGGAGCCCCGTGACCAT 



TGGGCTG TGTGTACTGC 

40 

150 160 170 180 190 200 210 

inputs CTGGTGTCAGGGGAGCCTGGAGGCCCAGGAGTACCGACTGGATAAAGAGGGAAGCCCAGAGCCCTTGGAC 



AAGTGATCC AAACACAGAG - - - -TGG- - 

50 60 70 

220 230 240 250 260 270 280 

i nputs AGAAATAACCCACTGGAACCCAAGAACAAGGCCAGATTCTCCATCCCATCCATGACAGAGCACCATGCGG 



CCCACT - - - - CCC CAAG CCTTCCC - TCCAGG 

80 90 

290 300 310 320 330 340 350 

inputs GGAGATACCGCTGCCACTATTACAGCTCTGCAGGCTGGTCAGAGCCCAGCGACCCCCTGGAGCTGGTGAT 



CTCAGCC CAGTTCCCTG - GTACCCCTGGGTCAG 

100 110 120 

360 370 380 390 400 410 420 

inputs GACAGGATTCTACAACAAACCCACCCTCTCAGCCCTGCCCAGCCCTGTGGTGGCCTCAGGGGGGAATATG 

-TCAG- -TTATTC TGAGGTG-C- -CAGGGA 

130 140 150 

430 440 450 460 470 480 

i nputs ACCCTCC - GATGTGGCTCACAGMGGGATATCACCATTTTGTTCTGATGAAGGAAGGAGAACACCAGCTC 



- - CCTCCAGATGTGG ATTTATATCGCCTGGAGAAACTGAAA 

160 170 180 190 

490 500 510 520 530 540 550 

inputs CCCCGGACCCTGGACTCACAGCAGCTCCACAGTGGGGGGTTCCAGGCCCTGTTCCCTGTGGGCCCCGTGA 

• • • . ... .. ... 

■ -CCGGA GA AGTATGAAGATCAAGAC - - -TTTCTCTT CATT- 

200 210 220 
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560 570 580 590 600 610 620 

inputs ACCCCAGCCACAGGTGGAGGTTCACATGCTATTACTATTATATGAACACCCCCCAGGTGTGGTCCCACCC 

- - - CC AACCATGGAAAGAAGTA - - -ATGCT GGAC GGTAT 

230 240 250 260 

630 640 650 660 670 680 690 

inputs CAGTGACCCCCTGGAGATTCTGCCCTCAGGCGTGTCTAGGAAGCCCTCCCTCCTGACCCTGCAGGGCCCT 

CGATG- - -CTCTTA TCAGA ATGGGAGTC ACTGGTCTCT 

270 280 290 

700 710 720 730 740 750 760 

inputs GTCCTGGCCCCTGGGCAGAGCCTGACCCTCCAGTGTGGCTCTGATGTCGGCTACGACAGATTTGTTCTGT 

• • • • • * ■ •••• 

CCCAAG TGACCAGCTTGAG CTAATT- - -GCTAC 

300 310 320 



770 780 790 800 810 820 830 

inputs ATAAGGAGGGGGAACGTGACTTCCTCCAGCGCCCTGGCCAGCAGCCCCAGGCTGGGCTCTCCCAGGCCAA 



- - AGGTGTGTATGCTAAAC - - CCTC ACTCTC ■ 

330 340 350 



840 850 860 870 880 890 900 

inputs CTTCACCCTGGGCCCTGTGAGCCCCTCCCACGGGGGCCAGTACAGGTGCTATGGTGCACACAACCTCTCC 

■ •■■ 

• •*»•*••• •■■ 

AGCTCATCCCA GCT 

360 

910 920 930 940 950 960 970 

inputs TCCGAGTGGTCGGCCCCCAGCGACCCCCTGAACATCCTGATGGCAGGACAGATCTATGACACCGTCTCCC 

CAGCAGTCCC TC- - -AAGGCAGG- - -GAT- -GTGACTCTGA 

370 380 390 400 

980 990 1000 1010 1020 1030 1040 

inputs TGTCAGCACAGCCGGGCCCCACAGTGGCCTCAGGAGAGAACGTGACCCTGCTGTGTCAGTCATGGTGGCA 

AGT GCCAGAGCCCATA CAGTTTTGATGA - - 

410 420 

1050 1060 1070 1080 1090 1100 1110 

i nputs GTTTGACACTTTCCTTCTGACCAAAGAAGGGGCAGCCCATCCCCCACTGCGTCTGAGATCAATGTACGGA 

• ■• "***« ••••••••••• • ■* > • • • • 

ATTCGTTCTATACAAAGAAGGGG AT ACTGGGCCTTATA- -AGAGACCTGA 

430 440 450 460 470 

FIG.8B 
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1120 1130 1140 1150 1160 1170 1180 

inputs GCTCATAAGTACCAGGCTGAATTCCCCATGAGTCCTGTGACCTCAGCCCACGCGGGGACCTACAGGTGCT 
• * ■■• • • ■ * *■ ■•• ■ * * • 

G - - AAATGGTACCGGGCCAATTTCCCCATCATCACA 

480 490 500 510 520 530 540 

1190 1200 1210 1220 1230 1240 1250 

inputs ACGGCTCATACAGCTCCAACCCCCACCTGCTGTCTTTCCCCAGTGAGCCCCTGGAACTCATGGTCTCAGG 



ACAGCTTCTCCAGCTCATCTCCATACCTGTGGTCAGCCCCGAGTGACCCTCTAGTGCTTGTGGTTACTGG 
550 560 570 580 590 600 610 

1260 1270 1280 1290 1300 1310 1320 

inputs ACACTCTGGAGGCTCCAGCCTCCCACCCACAGGGCCGCCCTCCACACCTGGTCTGGGAAGATACCTGGAG 

ACTCTCTG CCA- - CTCCCAGCC - - AGGT - -ACCCAC GGA-AGAATCATTTCCTG- - ■ 

620 630 640 650 660 

1330 1340 1350 1360 1370 1380 1390 

inputs GTTTTGATTGGGGTCTCGGTGGCCTTCGTCCTGCTGCTCTTCCTCCTCCTCTTCCTCCTCCTCCGACGTC 

*** * ■ **• •** » ***** ■■* ** 

TGA CAGAATCCT CCAGGAGACCTTCCA TCTTAC CCACAAACAAA 

670 680 690 700 

1400 1410 1420 1430 1440 1450 1460 

inputs AGCGTCACAGCAAACACAGGACATCTGACCAGAGAAAGACTGATTTCCAGCGTCCTGCAGGGGCTGCGGA 

A- - -TATCTACAA- - -CTGAA AAGCCTATGAATATC - - ACTGCCT - C - TCC AG - AGGGGCTG 

710 720 730 740 750 

1470 1480 1490 1500 1510 1520 1530 

inputs GACAGAGCCCAAGGACAGGGGCCTGCTGAGGAGGTCCAGCCCAGCTGCTGACGTCCAGGAAGAAAACCTC 



AGCCCT CC AATTGGTTTTGCTCATCAGCA C 

760 770 780 

1540 1550 1560 1570 1580 1590 1600 

inputs TATGCTGCCGTGAAGGACACACAGTCTGAGGACAGGGTGGAGCTGGACAGTCAGAGCCCACACGATGAAG 



TATGC CAAGGGGAATCTGGTC CGGATATG 

790 800 810 

1610 1620 1630 1640 1650 1660 1670 

i nput s ACCCCCAGGC AGTGACGTATGCCCCGGTGAAACACTCCAGTCCTAGGAGAGAAATGGCCTCTCCTCCCTC 

- - -CCTTGG TGCCACGAT TATAATAATTTTGT 

820 830 840 

1680 1690 1700 1710 1720 1730 1740 

i nputs CTCACTGTCTGGGGAA7TCCTGGACACAAAGGACAGACAGGTGGAAGAGGACAGGCAGATGGACACTGAG 

TGGGGCTT- -CTAG- - • CAGAGGATTGGC ACAGTCGGAAGAA AT 

850 860 870 880 
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1750 1760 1770 1780 1790 1800 1810 

i nputs GCTGCTGCATCTGAAGCCTCCCAGGATGTGACCTACGCCCAGCTGCACAGCTTGACCCTTAGACGGAAGG 

...... 

GC - - CTGCAAC A CAGGATGAGA GCTTTGC AAAGG 

890 900 910 

1820 1830 1840 1850 1860 1870 1880 

inputs CAACTGAGCCTCCTCCATCCCAGGAAGGGGAACCTCCAGCTGAGCCCAGCATCTACGCCACTCTGGCCAT 

•••• .... 

CCACTA CCACC CCTCC CACTGGCC - - 

920 930 

1890 
inputs CCAC 
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10 20 30 40 50 60 

inputs MSPASPTFFCIGLCVLQVIQTQSGPLPKPSLQAQPSSLVPLGQSVILRCQGPPDVDLYRLEKL-KPEKYE 

MTPALTALLCLGLSLGPRTRVQAGPFPKPT 

10 20 30 40 50 60 70 

70 80 90 100 110 120 130 

i inputs DQDFL F- IPTMERSNAGRYRCSYQNGSHWSLPSDQLELIATGVYAKPSLSAHPSSAVPQGRDV 

rnnplepknkarfsipsmtehi^gryrchWssagwsepsd 

80 90 100 110 120 130 140 



inputs TLKC--QSPY- 



tlrcgsqkgyhhfvlmkegehqlprtldsqqlhsggfqalfpvgpvnpshrwrftcyyyymntpqvwshp 

150 160 170 180 190 200 210 

140 150 

inputs SFDEFVLYKEGD 



sdpleilpsgvsrkpslltlqgpvlapgqsltlqcgsdvgydrfvlykegerdflqrpgqqpqaglsqan 

220 230 240 250 260 270 280 

160 

inputs TGPYK RP ekW-- 

FTLGPVSPSHGGQYRCYGAHNLSSEWSAPSDPLNILMAGQIYDTVSLSAQPGPTVASGENVTLLCQSWWQ 
290 300 310 320 330 340 350 

170 180 190 200 

inputs YRANFP I ITVTAAHSGTYRCYSFSSSSPYLWSAPSDPLVLVVTG 

r . i . ; ; . i i . i r . i *" * 

FDTFLLTKEGMHPPLRLRSMYGAHKYQAEFPMSPVTSAI^^ 

360 370 380 390 400 410 420 

210 220 
inputs LSATPSQVPTEES FPV 



HSGGSSLPPTGPPSTPGLGRYLEVLIGVSVAFVLLLFLLLFLLLRRQRHSKHRTSDQRKTDFQRPAGAAE 
430 440 450 460 470 480 490 

230 240 250 260 270 

i nput TESS RRPS I LPTNKISTTEKPMNI - TASPEGLSP -PIGFAH- - QHYAKGNLVR - - 1 



TEPKDRGLLRRSSPAADVQEENLYAAVKDTQSEDRVELDSQSPHDEDPQAVTYAPVKHSSPRREMASPPS 
500 510 520 530 540 550 560 

280 290 300 310 

i nputs CLGATI 1 1 1 LLGLLAEDWH SRKKC LQHRMRALQRP L PP LPL 

: . ... 

SLSGEFLDTKDRQVEEDRQMDTEAMSEASQDVT^ 

570 580 590 600 610 620 630 

inputs A 
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*->GesvtLtCsvsgfgppgvsvtWyfkngk Jgpsllgysysrlesgek 
G+sv L+C+ ++v y + k ++ +++ e + 

mT268 42 GQSVILRCQGP PDVDLY-RLEK1KP EKYEDQ-- 71 

anl segrf si ssl tLti ssvekeDsGtYtCvv<-* 
L i + e++++G Y+C 
mT268 72 DFLFIPTMERSNAGRYRCSY 91 

FIG.10A 



*- >GesvtLtCs vsgf gppgvs vtWyf kngk . 1 gpsl 1 gysysrl esgek 
G +vtL C++ ++ y k+g++ + Y+r+e + 

mT268 128 GRDVTLKCQSP - - -YSFDEFVLY-KEGDtGP YKRPEKW-Y 162 

anl segrf si ssl tLti ssvekeDsGtYtCvv<-* 
+ ++i++v++ sGtY+C 

mT268 163 RA NFPI ITVTAAHSGTYRCYS 183 
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10 20 30 40 50 60 

i nputs [MSPSPTALFCLGLCLGRV-P^QSGPLPKPSLQALPSSLVPLEKPVTLRCQGPPGVDLYRLEKLSSSRYQD 



MSPASPTFFCIGLCVLQVIQTp SGPLPKPSLQAQPSSLVPLGQSVILRCQGPPDVDLYRLEKLKPEKYED 
10 20 30 40 50 60 70 

70 80_ 90 100 110 120 130 

inputs QAVLFIPAMKRSLAGRYRCSYQNGSLWSLPSDQLELVATGVFAKPSLSAQPGPAVSSGGDVTLQCQTRYG 

QDFLFIPTMERSNAGRYRCSYQNGSHWSLPSDQLELIATGVYAKPSLSAHPSSAVPQGRDVTLKCQSPYS 
80 90 100 110 120 130 140 
1 40 150 160 170 180 190 200 
i nputs FDQFALYKEGDPAPYKNPERWYRASFPI ITVTAAMSGTYRCYSFSSRDPYLWSAPSDPLELVVTGTSVTP 

FDEFVLYKEGDTGPYKRPEKWYRANFPIITVTAAHSGTYRCYSFSSSSPYLWSAPSDPLVLVVTGLSATP 
150 160 170 180 190 200 210 
210 220 230 240 250 260 270 f 
inputs SRLPTEPPSSVAEFSEATAELTVSFTNKVFTTETSRSITTSPKESDSPAGPARQYYTKGNLVRICLGAVI 



SQVPTEESFPVTESSRRPSILP — TNKISTTEKPMNITASPEGLSPPIGFAHQHYAKGNLVRICLGATI 
220 230 240 250 260 270 

280 290 300 310 320 330 
i nputs LIILAGFLAEDWHSRRKRLRHRGRAVQRPLPPLPPLPQTRKSHGGQDGGRQDVHSRGLCS 



1 1 1 L LGLLAEDWHSRKKCLQHRMRALQRPL PPLP-LA- 
280 290 300 310 
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